NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Machete: An Efficient Lossy Floating-Point Compressor Designed for Time Series Databases

https://doi.org/10.1109/DCC58796.2024.00061

Shi, Yang; Zou, Xiangyu; Chen, Xinyu; Jin, Sian; Tao, Dingwen; Deng, Cai; Chen, Yufan; Xia, Wen (March 2024, IEEE)

Full Text Available
BIRD+: Design of a Lightweight Communication Compressor for Resource-Constrained Distribution Learning Platforms

https://doi.org/10.1109/TPDS.2024.3447221

Wu, Donglei; Yang, Weihao; Zou, Xiangyu; Feng, Hao; Tao, Dingwen; Li, Shiyi; Xia, Wen; Fang, Binxing (January 2024, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
Design of a Quantization-Based DNN Delta Compression Framework for Model Snapshots and Federated Learning

https://doi.org/10.1109/TPDS.2022.3230840

Jin, Haoyu; Wu, Donglei; Zhang, Shuyu; Zou, Xiangyu; Jin, Sian; Tao, Dingwen; Liao, Qing; Xia, Wen (March 2023, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats Similarity

https://doi.org/10.1145/3404397.3404408

Hu, Zhenbo; Zou, Xiangyu; Xia, Wen; Jin, Sian; Tao, Dingwen; Liu, Yang; Zhang, Weizhe; Zhang, Zheng (August 2020, The 49th International Conference on Parallel Processing (ICPP 2020))

Deep neural networks (DNNs) have gained considerable attention in various real-world applications due to the strong performance on representation learning. However, a DNN needs to be trained many epochs for pursuing a higher inference accuracy, which requires storing sequential versions of DNNs and releasing the updated versions to users. As a result, large amounts of storage and network resources are required, which significantly hamper DNN utilization on resource-constrained platforms (e.g., IoT, mobile phone). In this paper, we present a novel delta compression framework called Delta-DNN, which can efficiently compress the float-point numbers in DNNs by exploiting the floats similarity existing in DNNs during training. Specifically, (1) we observe the high similarity of float-point numbers between the neighboring versions of a neural network in training; (2) inspired by delta compression technique, we only record the delta (i.e., the differences) between two neighboring versions, instead of storing the full new version for DNNs; (3) we use the error-bounded lossy compression to compress the delta data for a high compression ratio, where the error bound is strictly assessed by an acceptable loss of DNNs’ inference accuracy; (4) we evaluate Delta-DNN’s performance on two scenarios, including reducing the transmission of releasing DNNs over network and saving the storage space occupied by multiple versions of DNNs. According to experimental results on six popular DNNs, DeltaDNN achieves the compression ratio 2x~10x higher than state-ofthe-art methods, while without sacrificing inference accuracy and changing the neural network structure.
more » « less
Full Text Available
CDAC: Content-Driven Deduplication-Aware Storage Cache

Tan, Yujuan; Xie, Jin; Xu, Congcong; Zhao, Yajun; Fu, Ming; Jiang, Hong; Yan, Zhichao; Chen, Zhangxian; Liu, Duo; Xia, Wen (May 2019, Proceedings of the 35th International Conference on Massive Storage Systems and Technology (MSST'19))

Full Text Available

Search for: All records